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MPROIXJCTION 

Conventional  tests  of  hypotheses  and  methods  for  defining  confidence  inter- 
vals are  not  strictly  valid  unless  the  conditions  and  assumptions  for  specific 
statistical  models  are  met.  Such  conditions  and  assumptions  may  be  more  nearly 
satisfied  if  the  involved  random  variables  are  transformed  to  new  forms.  This 
procedure  is  especially  useful  when  applied  to  analysis  of  variance  for  data 
derived  from  designs  such  as  randomized  blocks  and  Latin  squares. 

Transformations  are  especially  appropriate  when  a  random  variable  is  such 
that  its  variance  is  a  function  of  its  mean  \i     so  that  c2  ■  f(u).  The  proper 
change  of  variable  would  be  one  which  produces  a  new  random  variable  whose  mean 
and  variance  are  unrelated  since  homogeneity  of  variances  is  a  common  assumption 
in  many  statistical  investigations.  Other  assumptions  in  linear  models  usually 
specify  additivity  of  effects  and  normal  and  independently  distributed  random 
errors;  that  is  e  e  K(0,  0s).     Since  nonconformity  to  linear  models  is  not  un- 
common, a  need  for  developing  appropriate  transformations  applicable  to  partic- 
ular statistical  data  is  present. 

The  usual  purpose  for  transforming  a  variable  in  analysis  of  variance  data 
is  to  change  the  scale  of  measurement  in  order  to  make  the  analysis  more  valid. 
One  condition  required  for  assessing  accuracy  in  the  ordinary  unweighted  analy- 
sis of  variance  includes  the  important  one  of  a  constant  residual  or  error  var- 
iance. If  the  variance  tends  to  change  with  the  mean  level  of  the  measurements, 
the  variance  can  only  be  stabilized  by  a  suitable  change  of  scale.  The  func- 
tional relationship  of  variance  to  mean  level  determines  the  appropriate  trans- 
formation to  be  used.  For  some  data  a  transformation  which  achieves  homogeneity 
of  error  variance  has  an  added  beneficial  effect.  Nonnormality  often  occurs 

with  heterogeneity  of  error  variances,  and  a  transformation  may  partially  correct 


both  difficulties  simultaneously.  However,  the  norrjality  assumption  is  some- 
what less  important  than  homogeneous  variances  because  it  can  be  shown  that  the 
sampl Inc  distribution  of  the  F  ratio  is  relatively  insensitive  to  moderate 
departures  from  normality. 

A  third  reason  for  transforming  may  be  to  achieve  additivity  of  effects, 
which  implies  a  linear  model  which  does  not  contain  interaction  terms.  An  addi- 
tive linear  model,  free  of  interactions,  has  particular  advantages  in  the  case 
of  fixed  and  random  effects  in  the  same  experiment.  Tukey,s  test  for  non-addi- 
tivity  can  be  used  in  part  to  determine  the  appropriate  transformation  to  ob- 
tain additivity  of  effects.  However,  in  some  cases  there  is  an  intrinsic  inter- 
action between  the  factors  which  cannot  be  considered  a  function  of  the  choice 
of  the  scale  of  measurement  and  therefore  it  is  not  always  possible  to  find  a 
transformation  which  will  eliminate  non-additivity  in  a  given  situation. 

Battle tt  (19^7)  summarises  by  stating  that  the  ideal  transformation  will 
be  one  in  which: 

(1)  The  variance  of  the  transfor^d  variate  should  be  unaffected  by  changes 
in  the  mean  level. 

(2)  The  transformed  variate  should  be  normally  distributed. 

(3)  The  transformed  scale  should  be  one  for  which  an  arithmetic  average  is 
an  efficient  estimate  of  the  true  mean  level  for  any  particular  group  of 
measurements . 

(h)    The  transformed  scale  should  be  one  for  which  real  effects  are  linear 
and  additive. 


THEORETICAL  CONSH&RATIOHB 

Consider  a  variate  x  whoae  mean  u  -  K(x)  is  a  real  variable  vith  a 
R  of  possible  values,  and  a  standard  deviation  «x  »  f(u)  is  a  non- 
constant  function  of  ».     In  order  to  stabilise  variances,  it  is  necessary  to 
find  a  function  Y  -  f (x)  such  that  both  f (x)  and 

«•  -  I  fl  -  E(X)J  2 
are  functionally  independent  of  u  for  u  on  B.  This  would  require  that 

3#  3  0? 

— *  m  0    and    rr~~  "  0  , 

for  u  on  R. 

One  approach  to  the  problem  is  from  the  relationship 
dY  »  f •(x)dx  . 
An  approximate  solution  is  available  by  a  summation  process  such  that 

Oy  -  f  ,(n)0(tt)    • 

From  this  a  rough  approximation  con  be  derived  by  setting  this  expression 
equal  to  a  constant,  say  c,  thus  obtaining 

Then  f(x)  is  an  indefinite  integral  of  3  \   •  The  results  obtained  can 

only  be  useful  if  the  application  is  found  to  be  satisfactory. 

A  transformation  Y  ■  f(x),  which  produces  a  variate  Y  such  that  its 
distribution  is  exactly  normal,  depends  upon  the  original  variate  X.  One  can 
never  exactly  transform  a  di3crct:  variate  X  into  a  variate  Y  with  a  con- 
tinuous distribution.  However  any  variate  X  with  a  continuous  distribution 
function  F(x)  can  be  transformed  into  a  normally  distributed  variate  Y  by 


the  transformation  Y  «  f(X)  defined  by  the  equation 

Y 

r(x)-    J   JLe-i/sy2^    , 
-oo       lair 

However,  the  resulting  function  Y  -  f(x)  will  not  generally  be  functionally 

Independent  of  the  mean  of  X  and  there  is  the  practical  difficulty  of  solving 

this  equation  for  Y.  Therefore  it  seems  somewhat  more  reasonable  to  seek  as- 

ya^totic  solutions  to  the  problems  of  normalization  and  stabilisation  of  variance. 

Asymptotic  solutions  depend  on  the  assumption  that  the  distribution  of  X, 

with  mean  p  ■  |i   of  X  and  range  R  ,     is  determined  by  a  parameter  n  which 

tends  to  infinity.  Other  independent  parameters  are  present  and  represented 
collectively  by  0  with  a  range  0.  The  variatc  of  interest  is  Y  ■  f(X),  in 
which  f (X)  is  functionally  independent  of  p  and  the  parameter  e    ranges  on 
R.  or  Q    in  P,  and  such  that  the  distribution  of 

f(x)  -  f(nn) 
tends  as  n  ~^oo,  to  a  normal  distribution  while 

lim  of-Ce  , 
n— ?oo 

where  C*  is  an  absolute  constant.  This  does  not  deny  the  presence  of  the 
additional  parameters  0.  The  function  f(x)  may  depend  non-trivially  on  n, 
but  since  n  is  the  only  parameter  on  which  the  distribution  of  X  depends, 
then  f(x)  must  be  functionally  independent  of  n. 

The  following  theorems,  due  to  Curtiss  (19^3),  are  helpful  in  determining 
asymptotic  solutions. 
Theorem.  }.  Let  ♦  (x)  be  a  non-negative  function  of  x  and  n,  defined 

almost  everywhere  and  Lebesque  integrable  with  respect  to  x  over  any  finite 


interval  of  the  x-axis  for  each  a  >  0.  Let 


I  «  f(x)  -  J      *n(x)dx 


where  a  is  an  arbitrary  constant.  Let  F  (z)  he  the  distribution  function 

of  the  variate 

Z  -  (X  -  Mfc)^)   • 

Suppose  further  that  a  continuous  distribution  function  F(s)  exists  such  that 

Urn  F  (*)  -  F(a) 

n-.oo  n 

for  all  values  of  z.    Then  either  one  of  the  following  two  conditions  is  a 

sufficient  condition  for  the  distribution  function  iff)  of  the  variate 

n 

V  -  f(x)  -  f(un) 

to  tend  to  F(w),  -oo  <  w  <  oo: 

(a)  To  each  w  for  which  0  <  F(w)  <  1,  there  corresponds  for  all  n 
sufficiently  large  at  least  one  root  X  ■  X   to  the  equation 

J     tQ(u)du  -  w 

and  this  root  X„  has  the  property  that 
n 

U*    (Xn  '  »WH&>  "  w  ' 

n— ^cd 

(b)  For  all  n  sufficiently  large,  *n^^  >  0>  aiid 

lim  q_(w)  ■  1 
n— ^oo 

uniformly  in  any  closed  finite  subinterval  of  the  open  interval  defined 

by  0  <  ?(w)  <  1,  where 


The  following  lemma  is  useful  in  showing  condition  (a)  in  the  theorem  is  a 
consequence  of  condition  (b). 

ha'SEA  -  If  7  (v)  is  a  non -negative  function  integrablQ  over  any  finite  inter- 
val of  values  of  w;  and  if 

lia  7   (w)  »  1 
n— >co 

uniforaly  in  any  finite  closed  subinterval  of  an  inteival  w  <  w  <  wg,  then 

for  every  value  of  w  in  this  interval  there  exists  for  all  n  sufficiently 
large  a  solution  z  •  z   of  the  equation 


I    7n(v)dv  -  w 


0 

and  the  solution    z      has  the  property  that 

lira     z    »  v     . 
n  — »  a 

Theorem  2    Let  Y  (or  f(x)),  Z,  F-U)  sad  F(z)  be  defined  as  in 


1.  Let  the  mean  and  variance  of  the  distribution  defined  by  F(z)  exist  and 

have  respective  values  0  and  e2.  Then  the  following  three  conditions,  taken 

together,  are  sufficient  that 

lim  lE(Y)  -  f(u,  )]  •  0 
n  — »  oo         a 


n — ?co 


lim 


(i)    ^(Y2)  exists  for  n  >  0,  and   lia  E(zc)  »  e* 

n— >od 


(ii)        Condition  (b)  of  Th orem  1  holds. 

(iii)      fCzl^u^)!"1  ♦  uQ)  -  f(»^)  •  0|ZI  uniformly  in    n    as     III— >co. 

{ffaaoreiq  ^         Let  the  distribution  of  a  variate    Z    depend  upon  a  parameter 

n,     let    F  (z)    be  the  distribution  function  of    Z,     and  let    F(z)     be  a  contin- 

distrilution  function  with  the  property  that 


lia    F  (a)  -  F(z)     . 
n->oo  n 


Let  aQ  be  a  function  of  n  such  that 


lim  a„  •  a  «/  0. 
n 
n— .00 


the  distribution  function  of  the  variate  v  -  a  Z  tends  as  n  -*  co  to  the 

n 

distribution  function  F(v/  a)  if  a  >  0,  and  to  the  distribution  function 

1  -  F(v/  a)  if  a  <  0.  If  the  variance  of  Z  exists  sod  tends  to  c*  as 

n^co,  then  the  variance  of  a  Z  tends  to  a*ca  as  n— >co. 

n 


ran  soiare  HOOT  tmobfobmxich 


The  variance  of  a  Poisaon  distributed  variate  is  equal  to  its  mean.  Theory 

s  that  a  square  root  transformation  >nay  prove  useful.  For  enumeration 
.,  where  values  of  the  random  variable  arise  by  counts,  the  variate  may  tend 
to  be  distributed  in  a  Poisson  fashion.  If  the  mean  a  is  large,  and  actually 
equal  to  the  variance,  we  have: 

o*(fx)  *  1/  k   .  .  . 
or  taore  twierally  (l/  k)  \    if 

0*(x)  -  \  m  . 
the  above  formula  is  an  approximation  formula  only,  and  it  is  of  some  revelance 
to  see  how  far  the  variance  of  /x  f or  a  Poisson  variate  X  is  constant,  when 
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the  mean  m    becomes  small .  The  variance  of  j  X  can  also  be  calculated  for  a 
contlnuouo  distribution  for  which  the  variance  equals  the  mean,  where  p  is 
proportional  to 

X*-VXdx  . 
Bartlett  (1936)  has  investigated  numerically  the  degree  of  approximation  for 
values  of  n  from  .5  to  15.0  in  the  cases  a  *  0  and  a  »  1/  2.  He  found 


that  the  variance  of  Ix*  (l/  2)  is  considerably  closer  to  the  limit  (1/  h) 
for  1  <  m  <  10  than  is  the  variance  of  Jx,    At  n  »  15  the  variance  of  >[x 


is  .256  and  that  of  Jx  ♦  (l/  2)  is  .2^8.  For  the  continuous  curve,  the 
variance  of  the  transform  variate  approaches  its  limit  suprisingly  quickly,  shov- 


ing a  peak  about  m  ■  1,  which  disappears  when  1  X  +  (l/  2)  is  used.  Thus  >[T 
nay  be  considered  above  m  *  0,  and  from  10  to  about  2  or  3»  J  X  ♦  (l/  2) 
is  preferable.  Below  a  mean  m  of  about  2  or  3,  an  analysis  is  not  useful 
unless  a  large  number  of  replications  is  available  from  experimentation. 

Tor  example,  let  a  variate  X  have  a  Poisson  exponential  distribution  with 
parameter  n.  If  X  is  an  arbitrary  constant,  and  if 

Qx  ♦<*        x  >  -  a 

Y  -  f<X)  -  ) 

(_  0         x  <  -  a 


then  the  distribution  of  Y  -  ]n  ♦  X  tends  as  n —* 00  to  a  normal  distribution 
which  has  mean  zero  and  variance  1/  U,  and 

lim  o*  -  1/  h     . 
n— ?oo 

For    m^  ■  n,      cr    »  fm§     it  is  known,  Curtiss  (19A3),  that  the  distribution  of 

the  reduced  variate     (X  -  n)//n    tends  to  the  reduced  normal  distribution  as 
n—  ?oo.    By  Theorem  3,  the  distribution  of  the  variate 


z  - 


*  -a       i  .  rr~ .  x  -  p 


2     Jn+a        pS" 


2/n+a 

will  tend  to  normality  as  n  — >oo,  and  the  variance  of  Y  will  tend  to  the 
value  1/  k,    which  is  also  the  variance  of  the  limiting  distribution.  Let 

x  >  -  a 


*n(x)  *  /  :  -  a 


tfcM 


Y  -  f(x)  -  J*  *n(x)dx  . 

-  a 

The  proof  of  the  above  statement  can  be  verified  by  conditions  (ii)  and  (iii) 
of  Theorem  2.  Therefore  it  is  justifiable  to  use  an  appropriate  square  root 
transformation  on  a  Poisson  variatc  X. 

In  the  analysis  of  variance  for  a  Latin  square  it  is  standard  practice  to 
that  treatment,  row,  and  column  effects  are  additive.  The  expected  yield 


X.  of  the  ith  plot,  which  receives  the  Jth  treatment  and  occurs  in  the  kth  row 
and  the  1th  column  is  written 

W-*  +  Te  +  Ri+cj  +  6gijk 

whore  u  is  the  average  yield  in  the  experiment  and  T  ,  \L    and  C.  represent 
the  effects  of  treatment,  row  and  column  for  each  plot.  Because  the  T,  B  and 

C  constants  are  required  only  to  measure  differences  between  different  treat- 
ments, rows  and  columns,  we  may  put 

It  »Er.  «£c,  "0  . 
I  g  i  *  J  j 

With  normally  and  independently  distributed  experimental  errors,  simple  equations 
arc  available  for  estimating  unknown  parameters.  Therefore,  the  analysis  should 
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be  a  good  description  of  the  combined  action  of  treatment  and  block  effects. 

Any  set  of  yields  X.  vith  expectation  m,  may  be  written  in  a  probabil- 
ity statement  as: 

-i  xi 


V 

of  the  likehood  is 

L  -  L(X.log  m.  -  m.)  -  Z  log  X  f 

t     x    x    i    x  x 

This  gives  the  mavlwim  likehood  equation  of  estimating  any  parameter  6  in  the 

for.: 

E(X.  -  m.)  9BL 
i ±_  __*  m   0 

m.     o>9 

the  summation  extends  over  all  plots  whose  expectations  involve  0. 

Poisson  variation  contains  a  weighted  sum  of  the  deviations  from 


the  observed  expected  values,  the  factor  -*-  is  introduced  by  the  weights 

mi 


1  sa± 
-*-  rr*  .  The  presence  of  this  weight  factor  makes  a  linear  model 

mi  D$ 


unsuitable . 

However,  this  weight  can  be  stabilized  and  made  constant  by  assuming  a  linear 
prediction  formula  in  the  square  roots  and  transforming  the  data  to  square  roots. 
For  the  Latin  square,  this  prediction  formula  would  be 

^i  -  *  - » *  *e  *  Ri  ♦  ci 

where 

Zt-Er.  -  Z  c  ,  «  o  . 

g  g   i  *   J  i 
Through  the  use  of  the  method  of  Lagrangian  multipliers,  the  maximum  value 


u 


of  the  Logarithm  of  the  likelihood  is  found  by  maximising 

l*\(Zt)  *f(EL)  +7(2:  c.)  . 

g  S     1  X     J  J 

Estimation  of  a  typical  constant  yields  a  transformation  of  the  form 

yl  -  ^i  ' 
When  m.  la  small,  an  approximation  procedure  similar  to  the  rules  for  finding 
square  roots  is  available  and  sometimes  more  satisfactory. 

The  suitability  of  the  linear  prediction  formula  in  a  square  root  transfor- 
mation must  be  considered  when  this  analysis  is  being  employed.  The  prediction 
formula  can  be  evaluated  by 

U\  -  a)2 
x 

which  has  a  sufficiently  close  Xa  distribution.  A  high  value  of  this  Xe 
means  that  the  prediction  formula  is  not  satisfactory,  or  that  the  experimental 
errors  are  higher  than  the  Poisson  distribution  indicates  or  that  both  causes 
are  operating.  These  effects  can  sometimes  be  separated  by  examining  whether 
the  observed  yields  deviate  from  the  expected  yields  in  a  systematic  or  a  ran- 
dom manner.  If  th ■  deviation  is  systematic,  the  prediction  formula  is  pro'oa'bl ... 
not  satisfactory. 

Although  the  square  root  transformation  is  especially  applicable  for  a 
variate  with  a  Poisson  distribution,  it  is  often  times  a  good  transformation 
in  other  situations.  For  instance  at  times  a  useful  square  root  transformation 
is  sometimes  available  for  a  variate  with  a  r  distribution. 
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I/XJARITHMIC  TRAHSFGRMAIICH 

A  logarithmic  trans  format  ion  of  a  random  variable  X  is  appropriate  if 
effects  of  an  experiment  arc  proportional  to  the  mean  instead  of  additive.  The 
standard  deviation  of  the  variable  also  tends  to  vary  directly  as  the  mean.  A 
situation  "Where 

VX    •"     |i 


or  sometimes 


<•*» 


may  call  for  a  logarithmic  transformation-  Also  frequently  there  is  a  propor- 
tional relationship  between  the  means,  variances,  and  ranges  of  the  ordinal 
variable.  A  logarithmic  transformation  frequently  makes  the  ranges  si- 
gnal and  uncorrelated  vith  the  means.  A  transformation  that  tends  to  make 
the  ranges  more  uniform  vill  also  tend  to  make  the  variances  more  uniform.  In 
an  experimental  situation  one  can  apply  Tukey'a  test  for  non-additivity  to  each 
of  the  interaction  terms  to  check  the  adequacy  of  the  logarithmic  transformation. 
Graphically  one  might  observe  a  curvilinear  relationship  between  the  — yla  mean 
and  variance  which  suggests  that  the  relationship  might  be  quadratic.  If  this 
is  the  situation,  a  logarithmic  transformation  will  frequently  make  the  mean  and 
variance  unrelated,  thus  satisfying  the  assumptions  for  the  analysis  of  variance 
and  normal  theory* 

The  asymptotic  theory  for  a  logarithmic  transformation  differs  in  some  re- 
spects from  other  transformations.  Consider  a  variate  X  with  mean  u  and 
standard  deviation  o  end  with  functional  relationship 

•  "  \K  •  «>» 

where  a  is  an  arbitrary  constant,  k  >  0,  and  the   lim  k   exists  and  is 

n— >co 
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finite.  Stabilization  of  the  variance  of  X  at  k8  leads  us  to  the  function 

y  »  log  (x  +  a),       x  >  -  a  . 
If  X  is  a  varlate  such  that 

P(x  <  -  a)  m  o  , 
then  the  corresponding  reduced  varlate 


X  - 


!k 


Zm  ^V^T 


a  distribution  function  F  (z)  such  that 


*„(-!/  H,)-0  - 

11m  k  «  k  >  0  , 
n 
n— >oo 

the  limiting  distribution  of  Z,  if  it  exists,  must  have  a  distribution  function 

P(z)  such  that 

F(-l/  k  -  0)  m  0  . 

Hence,  the  limiting  distribution  of  Z  can  never  be  normal  if  k  >  0. 

Curtiss  (l<&3)  shoved  that  If  the  reduced  varlate  Z  does  have  a  limiting 

distribution,  the  varlate 

V 

^h   n 

may  have  a  limiting  distribution  vhlch  is  not  the  same  as  that  of  W.  More 
specifically  he  states: 

l£t 

P(x  <  -  a)  -  0  , 


lim  k  •  k  >  0  , 
n— >oo 


Ifc 


and  FQ(z)  be  the  density  function  of  the  reduced  variate 


z 


x-jk 


v% + «  • 


Furthermore  let  HQ(w)  be  the  distribution  function  of  the  variate  W  given 

above.  If  a  continuous  distribution  function  F(z)  exists  such  that 

lim    F  (y)  «  F(y) 
n  — oo  " 


for  all    y,     then 


lira    H 
n      oo 


(v)  -  \ 

F(v)  ,  k  -  0 


And  in  considering  the  distribution  of  the  transform  variate    Y    the 
following  theorem  is  useful. 

Theorem  5         Under  the  hypotheses  of  theorem  h,  and  under  the  conditions  that 
the  improper  integral 

J  v^dH^w)  /or        J  k^2  [log  (1  +  kv)l2dFa(v) 

-co  ^    -l/kn 

converges  uniformly  in  n  and  that 
oo 


f'   w^w)  »  1,  »  E(w)2  , 


-00 

the  following  relations  hold: 

/CD 


*  log  (1  ♦  k  )dF(z)         k  >  0 
-l/k  *         * 


lim  E(w)  » 

a—;  00 


k  -  0 
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ao 


f      p  tlog  (1  *  ks)]2dF(a)     k  >  0 

lim  E(w*)  i 
«-®      (  l  k-0 


Proa  this  it  can  be  shown  that  If  F(z)  is  independent  of  any 
eters  9,  end  if  k  is  positive  and  has  the  same  value  for  all  variates  in  a 
given  problem,  the  transformation 
Yx  -  log  (X  +  a) 

will  yield  en  asymptotic  stabilization  of  the  variance  under  the  conditions  of 
Theorem  5-  However,  if 

k-0,      I  •  log  (X  ♦  a) 
converges  stochastically  to  log  (m^  ♦  ot).  The  variance  of  the  variate 

Y  -  log  (X  +  a) 
can  be  stated  by  the  equation 

o*  -  kg  (e(w»)  -  [E(tr)]aj   . 

The  specific  application  of  a  logarithmic  transformation  depends  somewhat 
upon  the  experimeter^  insight  into  the  particular  type  of  variation  present  in 
the  original  variate  X.  If  from  a  large  number  of  observations  the  variate  is 
essentially  bounded  from  below  and  has  a  standard  deviation  which  is  proportional 
to  the  mean  u,  the  use  of  the  transformation 

Y  -  log  (X  +  a)  , 

with  -  a  less  than  the  lower  bound  of  X,  will  usually  yield  a  normally  dis- 
tributed variate  with  the  variance  independent  of  the  value  of  u.  In  general 
there  are  a  large  number  of  distributions  where  the  :aean  and  standard  deviation 
are  proportional.  The  use  of  a  logarithmic  transformation  In  the  analysis  of 
variance  should  follow-up  vith  teste  for  departure  from  normality  for  the  ob- 
served distribution  of  transform  d  values.  In  some  situations  the  logarithmic 
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transformation  may  be  somewhat  more  successful  in  stabilizing  the  variaooe 
rather  than  normalising  the  data.  However,  it  will  be  found  that  the  logarith- 
mic transformation  is  particularly  useful  in  normalising  distributions  which 
contain  a  positive  skewness  in  the  original  data. 

In  a  situation  where  a  random  variable  tends  to  have  a  binomial  distribu- 
tion, an  angular  or  inverse  sine  transformation  is  often  effective  in  stabiliz- 
ing the  variances.  For  such  quanta!  data  one  would  have  a  binomial  population 
in  which  a  proportion  P  possesses  a  given  attribute .  Sample  estimates  of  P, 
PA,  can  be  taken  and  will  give  sample  evidence  of  the  parameter  values.  If 

these  proportions  cover  a  wide  range  of  value,  the  inverse  sine  transformation 
of  the  original  variate  would  be  particularly  relevant  prior  to  the  analysis  of 
the  data.  This  wide  range  of  proportions  cannot  always  be  accounted  for  since 
the  expectation  of  p  equals  P  and  the  variance  of  p  equals  PQ/  a,  where 

Q  •  1  -  P  . 
The  inverse  sine  transformation  is  probably  the  most  common  angular  transforma- 
tion, but  there  are  other  angular  transformations  that  would  be  particularly 
applicable  if  additional  information  is  available  about  the  original  variate. 
She  Inverse  sine  square  root  transformation  Is  of  the  general  form: 

q(x)  »  sin"  [x     . 
If  the  transformation  is  effective,  the  approximately  constant  variance  on  the 
transform  is  821/  n,  provided  that  the  inverse  sine,  which  denotes  an  angle 
is  measured  In  degrees. 

Suppose  the  basic  observations  of  the  original  variate  are  proportions 
the  mean  and  variance  are  related  in  the  following  form 
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**  -  u(i  -  n)  . 
following  transformation  is  effective  in  stabilizing  the  variances  in  such 

instances, 

YA  •  2  arcsin  [x^ 

where  X.  is  a  proportion.  For  values  of  X  close  to  zero  or  close  to  unity, 
the  following  transformation  is  reconmsnded, 


r±  -  2  arcaln  Jxi  ♦  (|  n) 

where  n  is  the  number  of  observations  upon  which  X  is  based.  Both  of  the 
•hove  transformations  are  variations  of  the  general  form  and  the  analysis  is 
facilitated  by  an  insight  into  the  original  variate  X. . 

For  a  negative  binomial  variable  X,  with  mean  m  and  exponent  k,  the 
latter  being  constant  and  known,  the  appropriate  transformation  would  be 

-1 


y  ■  sinh 


X_i^ 


k  -  2c       * 
of    c    is  roughly    3/  8    if    ra    is  large  and    k  >  2,    and 
the  variance  is  approximately    ^  t'OO*    where    +»(t)    denotes  the  second 

derivative  of    In  r(t),     a  gaum  function  determined  by  the  cumulant-generating 
function,  with  respect  to    t. 

Consider  a  variate  X  with  a  binomial  relative  frequency  distribution  and 
parameter  p  which  has  n  possible  values  0,  l/  a,  2/  n,  .  .  .  n/  n  .  For 
an  arbitrary  constant    a,     if 

j  in  sin"1  Jx  ♦a/n    ,  -a/n<x<l-a/n 

y  •  f(x)  •  s 

I  0  ,  X    <    -a/n    , 

X    >    1  -  a/  n 
irtere    T    is  measured  in  radians,  then  the  distribution  of 
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Y.  -  in  sin"1  JpT  ♦  (a/n) 

tends  as    n    '  oo     to  a  normal  distribution  which  has  mean  zero  and  variance 

1/4    and 

11a    c*  »  1/  4     . 
a—*  co 

Since 

u    -  p         and         of  -  pq/  n    ,  where  g.  -  1  -  p    , 


»n      *  X      *"*'  "    ■  mfmmm  *      *      r    ■ 

it  can  be  shown  that  the  distribution  of  the  reduced  variate 

fn(x  -p)/lpq 
will  tend  to  normality  as    n   ->oo.     By  Theorem  3,  the  distribution  of 

7.  -    Ift  (*  -  P) 


will  tend  to  normality  with  a  limiting  variance  of  1/  h,    which  is  also  the 
limiting  distribution.  Let 


—..,,.■..,.,  ..JLtf, .  St  < 

( «         «  n  n 


o  I  *  .ft  i  i  i-« 


then  one  obtains  the  integral 

Y-    f  *(x)dx  . 


-a/n 


here  the  conditions  (ii)  and  (iii)  of  Theorem  2  can  be  satisfied  justify- 
ing the  asymptotic  considerations  of  this  angular  transformation. 

Sometimes  it  is  convenient  to  express  X  as  a  percentage,  which  has  the 
effect  of  multiplying  Z  by  100.  In  this  case  the  quantity 


J~n  sin*"1  J  X  ♦  100  J  -In  sin"1 1  lOOp  +  100  ■ 

n  n 
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has  a  distribution  approaching  normality,  and  oy— >50  instead  of  1/2. 

The  choice  of  a  ■  l/  2  is  somewhat  more  useful  for  p  near  0  or  1, 
but  the  choice  a  ■  0  is  aore  suitable  if  the  estimated  p  lies  between  .3 
and  .7  for  n  »  10  which  is  somewhat  caanon.  However,  for  other  values  of 
n,  the  question  as  to  the  choice  of  a  for  the  above  transformation  for  con- 
to  normality  is  somewhat  arbitrary. 

The  analysis  following  the  angular  transformation 

Y  -  sin"1  [p  ,    where   0  <  P  <  1,   and  0  <  Y  <  90° 
be  carried  out  in  one  of  two  ways,  (l)  as  a  multiple  regression  or  (2) 

as  an  analysis  of  variance,  both  methods  being  iterative.  If  n  is  not  con- 
stant, weight  analysis  may  be  carried  out,  but  due  to  the  loss  of  orthogonality 
the  advantages  of  the  analysis  of  variance  are  to  a  large  extent  lost. 
The  angular  transformation 

Y  -  f  (x)  -  2  aresin  (x    ,    where  0  <  X  <  1  ,  and 

0  <  Y  <  v    (radian  measure) 
is  very  effective  for  the  simplification  of  variance  of  random  variables  where 
the  variance  is  dependent  on  the  mean.  However,  other  transformations  such  as 

X  «  sin*  Y    where  0  <  X  <  1   and   0  <  Y  <  90°  (degree 

measure) 

are  equally  as  effective  for  stabilization  of  variance  and  will  have 

«« .  mi  +  i  . 

n     n 

However,  both  are  limited  with  the  problem  of  the  variance  largely  dependent  upon 
X  even  after  transformation  in  the  extreme  range  of  proportions.  Judicious  de- 
sign of  an  experiment  and  use  of  a  design  such  as  a  parallelogram  design  will 
somewhat  eliminate  this  problem.  It  is  frequently  useful  to  look  at  past  exper- 
imental material  or  preliminary  pilot  experiments  to  give  prior  indication  of 


this  problem.  Bartlett  (193'j)  has  proposed  an  adjustment  to  give  corrections 
for  such  extreme  responses.  This  adjustment  is 

Y(x)  m  arcain  «TI    where  0  <  x  <  1,   0  <  Y  <  90° 

KO)  .  arcain  jf. 

Y(l)  -  90  -  Y(o) 
For  practical  purpose  this  adjusted  transforraation  fulfills  the  requirement 
that  the  variance  is  Independent  of  the  mean  provided 

0.05  <  P  <  0.95    and    a  >  10 
and  is  constant  for  all  samples.  In  conclusion,  tables  are  readily  available 
for  application  of  the  angular  transformation  to  experimental  data. 

BB  PBQBH  TRA8SFQBMATI0H 

The  probit  transformation  lo  particularly  useful  when  applied  to  dosage- 
mortality  and  time -survival  data.  One  is  concerned  with  the  variables  of  the 
percentage  mortality  to  the  concentration  of  treatment.  The  graphical  repre- 
sentation of  these  two  variables  with  the  percentage  of  dead  organisms  on  the 
ordinate  to  the  concentration  of  dosage  or  treatment  on  the  abscissa  is  usually 
a  sigmoid  curve .  This  curve  resembles  the  letter  8,  with  the  change  in  per- 
klll  per  unit  of  the  abscissa  being  smallest  near  mortalities  of  0 

100  per  cent,  and  largest  near  50  per  cent. 

General  theory  implies  that  the  dosage -mortality  curve  is  primarily  de- 
scriptive of  the  variation  in  susceptibility  between  individuals  of  a  population. 
One  would  suspect  that  the  individual  lethal  dose  would  vary  from  individual  to 
individual  and  would  suspect  that  this  distribution  curve  would  be  normal  under 
uniform  conditions.  However,  it  is  not  easy  to  apply  experimental  techniques 
to  determine  this  lethal  dosage  because  all  «nir#.i,fl  more  susceptible  than  those 
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which  could  be  killed  with  a  smaller  dosage,  would  be  killed  along  with  those 
requiring  a  given  dosage.  This  requires  a  technique  where  the  dosage  is  varied 
and  determines  a  series  of  percentage  kills  for  proportionate  areas,  (  ■  *■ ■ ) . 
If  these  percentage  kills  are  then  plotted  on  the  ordinate  and  dosage  on  the 
abscissa,  one  would  obtain  a  cumulative  normal  frequency  distribution.  The  pro- 
portionate areas  are  those  of  the  categories  killed  or  not  killed.  The  assump- 
tion of  individual  susceptibility  can  be  easily  cheeked  by  experiment  and  com- 
parison with  the  theoretical  normal  curve  of  error.  The  expected  dosage  is  then 
given  in  terms  of  a  standard  deviation  about  a  mean  or  median  at  the  origin. 
However  this  system  would  introduce:  negative  expected  dosages  which  would  be  in- 
convenient . 

A  system  of  statistical  units  called  probits  or  probability  units  is  avail- 
able for  transforming  the  sigmoid  dosage -mortality  curve  to  a  straight  line. 
This  transformation  will  not  modify  the  proof  or  disproof  of  basic  assumptions 
and  is  one  obtained  by  equating  0  of  the  usual  stotistical  table  of  deviates 
to  the  digit  5  sad  then  the  deviates  of  the  normal  curve,  given  in  terms  of 
o  or  standard  deviations  are  added  to  this  unit  to  obtain  the  probit  correspond- 
ing to  each  percentage  kill.  tJometiuies  one  must  take  the  probit  transformation 
to  one  additional  step  if  the  variation  shows  a  geometrical  rather  than  arith- 
metical distribution.  This  possibility  can  be  tested  by  converting  the  observed 
dosages  to  logarithms  and  again  plotting  the  dosages  inferred  from  mortality  or 
probits  against  those  secured  experimentally.  This  double -trans format ion  will 
frequently  give  a  straight  line  relationship  and  often  times  give3  a  more  ade- 
quate measure  of  susceptibility. 

If  the  transformation  of  dosages  to  logarith.no  completes  the  transformation 
of  the  dosages -mortality  curve  to  a  straight  line,  it  is  an  index  to  the  inherent 


susceptibility  of  th,  individual  animal  to  the  poison.  There-  must  then  be  a 
direct  proportionality  between  the  concentration  of  the  poison  in  the  dose  ad- 
ministered and  the  «»unt  of  poison  fixed  by  th>  essential  tissues  of  the  ani- 
mal- Frequently  th  poisoning  of  an  individual  multicellular  animal  can  be 
attributed  to  the  death  of  a  proportion  of  the  essential  cells,  and  hence  the 
susceptibility  of  the  animal  as  a  whole  can  be  determined  by  the  average  suscep- 
tibility of  its  essential  cell3 .  Therefore  the  logarithmic  transformation  may 
be  sought  in  the  relation  between  the  dosage-  administered  and  the  amount  of 
poison  f 1 •  d  by  the  essential  celi3  or  tissues. 

One  of  the  principal  empirical  formulas  for  describing  the  absorption  or 
fi. cation  process  i3 

m 
where,  C  is  the  concentration  of  the  drug  (or  dosage),  X  «»  the  amount  fi  .  d 
in  the  organism,  m  -  mass  of  the  absorbing  constituents  within  the  organism, 
and  K  and  n  are  constants.  Since  m  is  essentially  constant  from  animal 
to  animal  one  can  combine  constants  and  reduce  the  abov.  formulas  to 

log  C  »  n  log  X  ■•»  K* 
Proa  this  it  is  apparent  that  there  is  a  linear  relation  between  the  logarithm 
of  the  concentration  and  tho  logarithm  of  the  amount  fi  m$  by  the  cells  of  the 
animal.  Ifcnce  the  observed  logarithmic  conversion  of  tho  dosage -mortality  curve 
is  not  duo,  to  use  of  the  amount  fled  in  the  tissue  as  the  true  individual  le- 
thal dose.  Another  absorption  equation  is  somewhat  more  consistent  as  far  as 
the  middle  and  higher  kills  and  dosages  are  concerned  in  the  logarithrarprobit 
transformation  • 

The  first  estimate  of  the  transformed  dosage -mortality  curve,  the  provi- 
sional regression  line,  is  ordinarily  not  calculated  but  represents  the  best 
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-nt  of  the  experimenter.  Sonet  iocs  the  experimenter  will  want  to  calcu- 
late this  first  estimate  if  the  observations  appear  to  be  widely  scattered. 
Occasionally  the  initial  transformed  dosage -mortality  curve  will  serve  the  needs 
of  the  experimenter  if  the  data  is  uniform. 

Fitting  of  a  dosmge-wortality  curve  is  an  attempt  to  infer  from  a  given 
experiment  the  conditions  obtaining  in  a  class  or  species  of  organisms,  the  cal- 
culated regression  line  of  the  dosage  probit  diagram  is  the  most  accurate  esti- 
mate which  can  be  drawn  fro*  the  data,  provided  that  basic  assumptions  are  cor- 
r  et.  In  some  cases  a  first  approximation  is  sufficient  but  frequently  it  will 
only  represent  a  rather  important  correction  of  initial  estimates.  Each  sepa- 
rate! observation  can  be  weighted  accurately  and  limits  can  be  determined  about 
the  calculated  regression  line .  Th.  regression  line  has  the  for* 

I  -  a  ♦  b(X  -  ") 
where,  Y  ■  the  mortality  in  probits  on  the  transformed  dosage  mortality  curve 
corresponding  to  any  given  dosage  X,  a  *  y  »  numerical  average  probit  for  all 
determinations  in  port  of  experiment  being  fitted  by  a  straight  line,  £  »  the 
average  dosages  in  logarithms,  and  b  is  the  regression  efficient  or  slop? 
of  the  line.  The  necessary  calculations  are: 

;  .  £(}&1  y  »  Sfccd  b  ,  fc^y)  -  x  L(wy)  and 

2Xv)  E(w) 

A  -  E(wx*)   -  I  £(w:) 

v  »  weight  of  a  given  observation,  the  product  of  the  weighting  coefficient 

multiplied  by  th-  no.  of  killed  plus  survived, 
x  ■  a  function  of  the  experimental  dosage,  usually  its  logarithm. 
y  »  the  probit  corresponding  to  th  observed  percentage  mortality. 
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The  regression  line-  will  always  pass  through  the  point  x,  y  and  hence  fix 
the;  degree  of  the  susceptibility  to  a  to.<lc  agent  shown  by  the  population  as  a 
whole.  Fron  a  statistical  viewpoint,  b  is  the  slope  or  the  tangent  of  the 
angle  with  which  the  regression  line  will  pass  through  the  point  established  by 
I  *aA    y.  From  a  biological  viewpoint,  b  measures  how  closely  the  individual 
organisms  in  the  experiment  agree  with  one  another  in  their  sensitivity  to  the 
toxic  agent.  It  Is  convenient  to  expreae  this  toxicologic©!  characteristic  as 
the  percentage  increase  in  dosage  that  la  required  to  increase  kill  by  one  pro- 
bit.  This  ia  the  ratio  of  100  Iog  10  to  b  or  339 'S6  . 

c  b 

One  Is  of  course  concerned  about  how  close  the  sample  evidence  agrees  with 
the  true  situation.  There  is  a  X8  t  st  available  for  comparing  how  well  this 
true  relationship  Is  appro  cimatod.  It  involves  few  computations  beyond  thorns 
required  for  determining  the  regression  equation.  The  first  consideration  ia 
to  determine  whether  the  observed  mortalitiea  agree  with  the  assumption  of  a 
rectilinear  relationship  on  the  logarithmic  probability  acale  within  limits  of 
sampling  error.  Each  dosage  observed  la  compared  with  the  expected  dosage  from 
the  regression  equation,  but  instead  of  calculating  separately  each  expected 
probit  (mortality)  and  then  subtracting  it  from  the  observed  probit  (mortality), 
a  short  cut  method  adopted  by  Fisher  may  be  used.  The  Xs  may  be  calculated 

X*  -  [  I(wy*)  -  y  2<wy)}  -  b(  Kwry)  ■  *  Z(vy)J 
and  it  depends  upon  its  degrees  of  freedom  n».  Sometimes  determination  of  the 
proper  degree  of  freedom  la  not  always  straight-forward.  Procedures  are  also 
available  for  determining  the  variances  of  position  and  slope.  Additionally  one- 
can  calculate  s  zone  of  error  of  the  regression  line  to  supplement  the  calculated 
regression  line  which  is  the  best  available  estimate  of  the  true 
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curve .  One  then  can  make  probability  statements  about  whether  the  stone  encloses 
the  true  dosage-mortality  curve  vhen  transformed  to  the  logarithmic-problt  dia- 
gram* the  case  of  zero  survivors  is  an  important  problem  in  the  case  of  surviv- 
al data.  This  problem  has  been  considered  by  R.  A.  Fisher.  For  experiments 
with  no  survivors,  x  «  oo  with  weight 

zfi 

~-  ->Zx-?0 

for  large  samples.  The  equations  appropriate  for  plotting  the  points  on  the 
probit  diagram,  namely 

and    A    f      e"V  -  q 


n  &r 
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cannot  be  used  in  this  form  for  fitting  the  regression  line  when  the  number  of 
survivors  is  small.  One  must  apply  the  Method  of  Maximum  Likelihood  to  such 
cues .  By  introduction  of  a  weighting  deviate  a  procedure  is  then  available  so 
•jqaeriments  with  few  or  no  survivors  any  exert  their  proper  influence  in  adjust- 
ing the  regression  line .  This  correction  is  necessary  because  the  omission  of 
experiments  because  they  show  no  survivors  would  constantly  bias  the  estimates 
in  the  sense  of  exaggerating  the  number  of  survivors  to  be  expected. 

In  general  the  probit  technique  involves  the  assumption  that  susceptibility 
is  distributed  normally,  with  inferred  dosages,  in  terms  of  units  called  probits, 
plotted  against  the  logarithms  of  their  corresponding  observed  dosages  giving  a 
straight  line  relationship.  The  procedure  or  transformation  is  particularly 
adaptable  and  statistical  methods  arc  available  for  considering  0  or  100  per 
cent  kills,  for  determining  a  weight  proportional  to  its  reliability,  for  comput- 
ing the  position  and  slope  of  the  transformed  dosage -mortality  curve,  for  measur- 
ing the  goodness  of  fit  of  the  regression  line  to  the  observations  by  the  X* 
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test,  and  for  calculating  the  error  In  position  and  in  slope  and  their  combined 
effect  at  any  log-dosage .    Tables  are  available  for  determining  the  probit  values 
In  the  probit  or  log-problt  transformation. 

23AB3P0«i-IftII0HS  WITH  FRACTIGBAL  POWEHS  OF  THH  VARIABUC 

Frequently  it  is  advantageous  to  transform  a  non-nornal  variable  to 
approximately  a  normal  variable  for  certain  forms  appearing  in  the  analysis  of 
variance.    This  i3  convenient  because  the  normal  distribution  has  been  so  exhaus- 
tively investigated  and  tabulated  and  also  because  certain  models  specify  a  nor- 
mal distribution  of  variates.    A  transformation  of  importance  is 

y  »  X*         *bere  0  <  r  <  1 

and    X     ia  a  non-negative  variate.     One  might  also  consider  the  transformation 
to  a  Type  III  variate  and  the  use  of    (x  +  o)r    where    o    is  a  random  variable 
in  the  interval    to,  l] .    One  can  approach  the  problem  by  observing  the  effect 
of  teking  the  1/  rth  power  of  a  normal  variate  rather  than  considering  the  effect 
of  the  rth  power  of  a  general  variate    X.    Therefore  ve  would  have 

.  =  4\  b  &  2  .:.)     , 
where    Z    is  a  normal  variable  with  mean  aero  and  standard  deviation  unity. 
This  would  ;aakc    X    a  normal  variable  with  mean    ■    and  coefficient  of  varia- 
tion (the  standard  deviation  divided  by  the  mean)  equal  to    K '     •     One  assumes 
that    m    is  large  and  positive  and    k    is  small  so  that 

F{X2$0J<€ 
and     e     is  extremely  small.    Ve  can  find  the  monenta  by  evaluation  of 

I^X*)  -  m^d  +  kl/f  2  zf 
where    t    is  greater  than  unity  and  has  both  integral  and  fractional  values. 
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for  transformations  of 

r  -  1/  2,  1/  3,  1/  *►,  •  •  • 
the  values  of  t  needed  will  always  be  Integral  and  thus  exact  values  of  the 
uts  X2,  X  ,  X  ,  .  .  .  aay  be  obtained.  From  here  the  moments  about  the 


and  the  values  of  the  moment  ratios  can  be  obtained.  If  r  is  not  of  this 
•logic  form,  the  numerator  of  the  fraction  Is  not  unity,  and  hence  t  vill  be 
fractional  and  the  moments  of  the  transform  variate  can  only  be  found  as  a 
potter  series  in  k.  This  limitation  is  not  of  serious  consequence  if  k  Is 
small  and  one  can  omit  the  higher  powers  of  the  expansion.  The  results  of  the 
previous  discussion  have  been  carried  out  by  (Moore,  1957)  and  are  given  in  the 
following  table  forra  as  functions  of  k.  The  first  cases  are  approximations  while 
the  last  three  are  exact.  These  can  be  graphed  and  from  diagramatic  form  one 
would  see  that  a  change  of  r  would  possibly  radically  alter  the  value  of  k 
even  though  the  (p.,  ftj  points  of  the  transf ormod  variate  may  be  very  close . 

For  a  practical  application  of  the  transformation  one  would  make  the  first  two 
aooents  of  the  transformed  variate  equal  to  those  of  a  normal  curve  to  insure 
a  correct  value  of  k  was  being  utilised.  It  is  frequently  useful  to  examine 
the  coefficient  of  variation  of  the  transformed  variate,  xV  r.  One  can  then 
determine  if  the  variable  has  both  the  correct  momenta!  ratios  and  the  correct 
coefficient  of  variation  to  be  transformed  into  an  approximate  normal  distribu- 
tion. Essentially  the  same  procedure  can  be  used  for  transformation  to  a  Xs 
or  Type  HI  distribution.  One  would  have  to  utilize  the  moments  for  determining 
the  proper  constants.  One  should  also  consider  the  coefficient  of  variation  c 
aa  well  as  the  appropriate  values  of  $L     and  ft.  for  determination  of  the 
proper  particular  transformation.  Many  other  distributions  can  be  handled  in 
a  similar  fashion. 
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Table  1 
Momenta!  Ratios  of    X1/ r    if    X    is  Horraal 


"I  »2 


3/  h         k(l  +  0.7963k  +  0. 33^763**)  3(l  +  0.lU8U»8k  +  0.021602k2  + 

0.0O3O86k3) 

2/  3    2.25k(l  +  0.29l667k  +  0.061*23**)  3(l  +  0.6k  ♦  0.l6k*) 

1/  2    8k(3  +  k)*(2  ♦  k)"3  3(^  ♦  20k  +  5k2)(2  +  k)"2 

1/  3    108k(3  ♦  16k  +  15k*)*(3  ♦  12k  ♦  3  +  72k(7  ♦  *»8k  +  75k8  •  15k3)(3  + 
5k2)"3  12k  +  5k2)"2 

1/  fc    l62k(2  ♦  32k  ♦  U9k*  ♦  198k3  ♦  3  ♦  (*U2%  ♦  94201k2  ♦  70735k3  ♦ 

33k  )a  x  (2  +  21k  ♦  Wk2  +        2042501k*  +  1880525k5  +  235008k6) 
12k3)"2  1  8(2  +  21k  +  48k*  +  12k3)"2 


INVERSE  HJffERBGLle  SHIH  TRANSFORMATION 

In  certain  entomological  experiments  the  variate  nay  be  one  such  that  it 
cannot  strictly  be  subjected  to  the  analysis  of  variance.  It  is  of  concern 
then  to  determine  how  the  variable  may  be  transformed  so  that  the  analysis  of 
variance  becomes  applicable. 

One  problem  in  entociological  experlmentn  is  vith  a  variate  X  which  would 
be  the  number  of  insects  in  one  of  a  group  of  small  contiguous  areas,  say  plots, 
within  a  larger  area,  say  a  block.  If  we  let  the  expectation  of  X  over  all 
these  plots  be  K  and  the  standard  deviation  be  0 .  then  over  a  number  of 
these  larger  areas,  the  counts  of  insects  arc  distributed  in  a  completely  ran- 
dom fashion,  from  the  Poisson  distribution,  o2  «  H.  However,  situations  occur 
due  to  nature  and  changes  are  present  in  expectation  from  plot  to  plot  within 

a  olock.  Therefore,  o2   will  tend  to  be  greater  than  M  and  we  can  only 
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describe  thie  variance  by  the  statement 

O*  -  f(M)   . 
This  function,  f(M),  must  be  considered  carefully  since  it  determines  the 
transformation  vhich  may  be  developed  to  make  the  standard  deviation  independ- 
ent of  the  mean. 

One  proposal  to  the  problem  is  to  state, 

0*  »  KM  , 
vhere  K  is  a  constant.  However,  in  field  data  there  does  not  tend  to  be  a 
linear  relationship  between  o*  and  M  or  their  respective  estimates  sa  and 
xt    but  there  tends  to  be  a  disproportionately  greater  departure  as  I  increases, 
The  preceding  considerations  would  suggest  that 

o*  -  N  .«*  M 
la  generally  untrue .  The  curvilinoarity  might  be  represented  by 

o*  -  M  oCl^ 
vhich  leads  to  the  relationship  that 

o*  *  M  +  ktf 
vhere  k  is  a  suitable  constant.  It  will  then  be  noted  that 

K  -  (0s  -  M>M* 2 
is  the  Charlier  coefficient  of  disturbance  from  a  Poissou  distribution.  If 
cooparisono  are  made  to  consider  the  suitability  of  the  relationship 

o*  «  M  +  krf8  , 
they  vill  involve  finding  hov  they  fit  observations  on  sc  and  x.  The  exact 
fit  is  difficult  to  determine  and  it  is  nccesaary  to  fall  back  upon  pairs  of 
estimates  x  and  s£  from  the  data  for  the  empirical  determination  of  K 
k.  Beull  (1^2)  estimates 

K  m  L  s8/  L  x  ,    k  »  (I.  s»  -  Z  x)/  Z  x2 
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where  Z  represents  the  summation  over  all  pairs. 

Upon  application  to  field  data  Beull  has  found  that 
ff2  ■  M  +  kK2 
is  generally  e  better  approximation  to  the  form  of  f (m)  and  makes  it  prefer- 
able to  proceed  vith  the  analysis  of  data  from  this  assumption.  The  transfor- 
mation appropriate  to  this  type  of  data 

X'  -  k"1/ 2  sinh"1  (kx)l/2 
was  developed  by  the  method  of  Tlppett  (l93^) • 

The  adequacy  of  this  transformation  is  Judged  by  the  extent  to  which  it 
stabilizes  variability.  If  in  the:  transformation  we  express 

sinh*1  (kx)1/  2  when  kx  <  1     , 

as  a  well  known  series,  we  have 

where  for 

k  -  0  ,    X«  =  iH*     . 

For  large  values  if  kx,  X»  varies  almost  as  log  X  or  as  log  (X  +  l)  so 
the  above  expansion  for  practical  purposes  embraces  the  root  and  logarithmic 
transformations.  Beall  (19^2)  gives  a  table  for  the  transformation  for  a  prob- 
able range  of  observations  of  X  and  k  at  intervals  close  enough  for  practi- 
cal purposes.  Values  of  X1  outside  those  can  be  calculated  from 

X'  -  kl/  2  loge  {(kx)1/  2  +  (1+  kx)17  2]   . 

When  using  field  data  in  making  the  tranpforration,  it  is  necessary  to 
estimate  the  value  of  k  empirically  by 

k-  (Ls2  -LxJ/Zx2 
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for  which  estimates  x  of  the  mean  and  s  of  the  standard  deviation  must  he 
found.  The  proposed  transformation  besides  making  the  variability  within  a 
block  for  a  repeated  treatment  the  same  for  all  treatments  and  block,  should 
also  provide  quantities  satisfying  the  assumptions  underlying  the  analysis  of 
variance.  That  is  in  the  analysis  of  variance  the  chance  variability  for  each 
plot  shall  be,  when  the  effects  of  block  and  of  treatment  are  removed,  normally 
distributed  with  a  standard  deviation  common  to  all  plots,  in  which  of  course 
the  standard  deviation  of  the  chance  variability  for  a  given  plot  is  independent 
of  the  expectation  for  that  plot.  Diagramatically  this  can  be  compared  and  the 
proposed  transformation  should  tend  to  make  the  standard  deviation  independent 
of  the  mean  in  accordance  with  the  assumptions  underlying  the  analysis  of  var- 
iance. 

Several  results  may  be  noted  or  compared  to  suggest  that  the  transformation 
proposed  did  tend  to  make  the  variability  within  a  given  treatment  independent 
of  the  mean  for  that  treatment:  Some  are:  l)  Indication  after  transformation 
that  the  residual  sum  of  squares  about  the  regression  was  greater  than  the  re- 
duction in  squares  due  to  regression,  whereas  it  would  be  consistently  less 
before  trans format i6n.  2)  As  can  be  seen  above,  the  regression  generally  did 
not  effect  a  significant  reduction  in  variability  after  transformation  but  did 
before  (the  small  number  of  degrees  of  freedom  made  high  significance  difficult 
of  attainment).  3)  After  transformation  the  sign  of  the  regression  will  be  a 
chance  matter,  whereas  before  transformation  it  was  consistently  positive.  This 
result  suggests  that  the  proposed  transformation  did  tend  to  make  the  variability 
within  a  given  treatment  independent  of  the  mean  for  that  treatment. 

The  mathematical  basis  for  the  proceeding  transformation  was  suggested  by 
th  -  method  used  by  Tippctt  ( 193*0  •  The  procedure  is  as  follows .  It  is  required 


32 


to  find  I*  *  f(x)  such  that  the  standard  deviation  c   '  of  X',  shall  he 

approximately  constant.  We  could  have 

X*  •  f(M)  +  f'(M)(X  -  M)  +  .  .  . 
where  H  ia  the  expectation  of  X  and  hence  approximately 

(X»  -  M»)  «  i'(li)(X  -  M) 
where  M1  is  the  expectation  of  X1.  Hence 

where  o  is  the  standard  deviation  of  the  observations,  X.  Replacing  ox, 

in  the  above  equation  by  a  constant,  C,  and  substituting  for  a    we  have 

f»(M)  m  C(M  +  kM6)"1/2 
where  k  is  as  has  been  previously  discussed  a  constant  peculiar  to  our  data. 
Integrating  the  above  equation  one  obtains 

f(M)  -  ack"1/  2  sinh"1  (kM)1^  2  . 

From  this  integration,  the  proper  transformation  appears  to  be  of  the  fore 

,  .-1  ,.    a/ 2 
sinh      (kx)   ' 

but  it  is  wise  instead  to  use 

»*/  2   sin*,"1  (Kx)1/  2 

because  this  transformation  becomes  identical  with  the  established  transformation 

1/  2 
X  '  ,  when  k  *  0.  The  derivation  is  such  that  the  transformation  can  only  be 

justified  if  the  application  is  found  satisfactory.  The  transformation  should 

then  put  the  data  in  a  form  where  the  standard  deviation  approaches  a  constant 

independent  of  the  mean.  The  estimation  of  the  constant  k  can  be  facilitated 

by  the  design  of  an  experiment  with  a  repetition  of  treatments  within  blocks. 
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MODIFIED  AND  SPECIAL  TRANSFORMATIONS 

Frequently  biological  populations  change  proportional  to  the  mean,  which 
infer  that  the  changes  would  be  independent  of  the  mean  on  the  logarithmic 
scale.  This  situation  might  be  encountered  in  field  plot  experiments  where 
the  variate  of  interest  would  be  the  percentage  of  area  covered  by  a  specific 
crop.  Often  times  an  efficient  comparison  will  involve  a  regression  of  this 
percentage  area  covered  with  time.  If  the  percentage  areas  are  less  variable 
when  near  0  (or  100 )  than  when  in  the  middle  of  the  range,  a  suitable  trans- 
formation might  lead  to  improvement  of  the  data.  Bartlett  (±9kj)   has  found  that 
the  transformation 

y  »  log  (x/  (100  -  x)) 
where  x  is  a  percentage  variate,  summarized  the  time  change  and  has  the  effect 
of  changing  the  regression  lines  at  0  on  the  original  scale.  Analysis  of 
variance  would  then  be  made  upon  linear  regressions  of  the  percentage  variate 
with  time.  The  transformation  is  an  empirical  one  and  rationalization  of  its 
use  is  implied  by  noticing  that  the  change  in  percentage  area  depends  upon  the 
amount  of  plant  species  as  well  as  the  available  area  for  expansion,  or  the 
competition  of  species  causing  it  to  decrease.  For  a  constant  cause  of  change, 
the  percentage  area  X  would  depend  upon  an  equation 

f*  =  Ax(l00  -  x)   . 

By  integration 

loS  im"  v   -  Bt  +  C   or   X  =  t-— i22~ -y 

100  -  x  1  +  exp  -(Bt  +  C) 

which  is  the  formula  for  the  law  of  population  growth. 

Transformation  of  the  sample  correlation  r  will  often  make  its  disturbance 
less  skew  and  more  stable  in  variance .  Because  the  variance  of  a  correlation 
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coefficient  is  approximately 

(i  -  p2)2/  ft  -  X    , 
VbUPa  p  is  the  true  value  of  the  coefficient  and  ft  is  the  number  of  observa- 
tions in  the  sample,  one  obtains  the  transformation 

Z  -  1/2  log  [(1  +  r)/  (1  -  r)} 
to  make  the  variance  independent  of  the  mean.  Although  rare,  it  is  possible 
to  analyze  correlation  coefficients  by  the  analysis  of  variance,  and  the  above 
transformation  would  be  the  appropriate  one. 

The  above  transformation  is  of  use  because  often  reliable  results  are 
useful  to: 

(1)  Test  if  an  observed  correlation  differs  significantly  from  a  theoretical 
value. 

(2)  Test  if  two  observed  correlations  are  significantly  different. 

(3)  Combine  a  number  of  independent  estimates  (if  available)  into  an  improved 
estimate . 

(k)     Perform  tests  (l)  and  (2)  with  such  average  values. 

This  transformation  has  the  following  characteristics.  It  leads  approximately 
to  a  normal  distribution  where  tests  may  be  carried  out  without  difficulty.  As 
r  changes  from  0  to  1,  Z  will  pass  from  0  to  oo .  For  small  values  of 
r,  Z  is  nearly  equal  to  r,  but  as  r  approaches  unity,  Z  increases  with- 
out limit.  For  negative  values  of  r,  Z  is  negative.  One  advantage  of  this 
trans  format  ion  of  r  into  Z  lies  in  the  distribution  of  these  two  quantities 
in  random  samples.  The  standard  deviation  of  r  depends  on  the  true  value  of 
the  correlation  p,  as  is  seen  from  the  formula 

.    1  -  o2 

o  = c— 

r 
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Since    p    is  unknown,  we  have  to  substitute  for  it  the  observed  value    r,     and 
this  value  is  noc  a  very  accurate  estimate  of    p     in  suall  samples.    The  stand- 
ard error  of    Z    is  simpler  in  form,  ai  ly 

1 
n»  -  3 

sad  is  practically  independent  of  the  value  of  the  correlation  fro;a  which  the 
Maple  was  drawn.  Secondly,  the  distribution  of  r  is  not  normal  in  small 
Maples,  and  even  remains  far  from  normal  for  lar©3  samples  and  high  correlations. 
She  distribution  of  Z    tends  to  normality  for  large  Mmplee  and  any  value  of 
the  correlation.  Finally,  I  changes  rapidly  in  distribution  as  p  changes, 
whereas,  S  is  nearly  constant  in  distribution  and  accuracy  can  be  Improved  by 
small  corrections  for  departure  from  normality  although  they  are  not  necessary. 
ThorcCor"  one  con  assu-ae  that  Z    is  normally  distributed  with  sufficient  accu- 
racy in  many  cases. 

SUMttRY  AHD  COHCUUSIONS 

The  aelection  of  a  transform  scale  of  measurement  will  depend  upon  (l) 
The  nature  of  the  data,  and  (2)  the  statistical  procedures  to  be  used.  Cho 
of  an  appropriate  transformation  depends  on  the  nature;  of  the  original  variate. 
Often  times  on«  has  to  apply  a  transformation  and  then  arte  appropriate  tests 
to  determine  the  effectiveness  of  the  transformation,  Sonotimes  it  is  not  pos- 
„o  normalise  the  data  for  certain  types  of  variates. 

It  may  be  Impossible  to  find  a  transformation  that  results  in  honoaanaous 
error  variances.  However,  if  enough  sample  evidence  is  available,  it  is  usually 
possible  to  determine  a  relationship  between  means  and  variances  and  to  mate 
them  independent  by  an  appropriate  transformation.  The  standard  or  basic 
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transformation  are  the  square  root,  logarithmic ,  and  angular  or  arcsine  trans- 
formation. Many  variations  of  these  basic  transformations  are  available. 
Transformations  of  this  type  have  achieved  new  importance  when  considered  with 
experimental  designs  and  the  analysis  of  variance.  Bartlett  (19^7)  has  the 
following  suggested  summary  of  appropriate  transformations. 
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ABSTRACT 

The  method  of  transformino  random  variates  arises  from  a  need  to  satisfy 
certain  assumptions  used  in  the  analysis  of  variance  for  numerical  data.  The 
procedures  are  particularly  applicable  if  the  variance  and  the  mean  of  the 
original  variate  are  functionally  related.  Application  of  an  appropriate  trans- 
formation is  facilitated  if  this  functional  relationship  is  known. 

This  report  considers  some  of  the  mathematical  theory  and  expected  results 
when  a  transformation  is  applied.  A  transformation  considered  and  of  interest 
is  the  square  root  transformation  for  a  variate  with  a  Poisson  distribution  and 
with  the  variance  proportional  to  the  mean.  This  type  of  variate  is  encountered 
in  enumeration  data,  with  counts  for  the  variable.  The  square  root  transforma- 
tion should,  in  this  situation,  produce  variances  independent  of  means. 

A  logarithmic  transformation  of  the  original  variate  is  useful  if  the  stand- 
ard deviation  of  the  variable  tends  to  vary  directly  as  the  mean  or  if  the  ef- 
fects in  an  experiment  are  proportional  to  the  mean  instead  of  additive.  Graph- 
ical comparisons  and  applications  of  Tukey's  test  for  non-additivity  are  helpful 
to  determine  if  the  transformation  has  provided  the  desired  result. 

An  angular  transformation  is  of  consequence  if  the  original  variate  tends 
to  be  distributed  in  a  binomial  fashion  and  if  the  proportions  tend  to  vary  over 
a  rather  wide  range.  An  angular  or  inverse  sine  transformation  should  produce 
a  variable  with  rather  constant  variance  if  it  is  effective.  The  Probit  trans- 
formation is  applicable  to  dosage -mortality  and  time-survival  data  in  biologi- 
cal experiments.  The  probit  technique  is  used  in  relation  to  regression  tech- 
niques to  determine  lethal  dosage  in  such  experiments. 
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A  fractional  power  transformation  is  useful  in  transforming  a  variate  to 
a  normal  or  other  revelant  distribution  about  which  results  are  obtainable. 
The  inverse  hyperbolic  sine  transformation  is  applicable  in  entomological  ex- 
periments where  there  is  a  curvilinear  relationship  between  the  mean  and  var- 
iance. This  transformation  should  produce  a  standard  deviation  that  approaches 
a  constant  independent  of  the  mean  if  an  analysis  of  variance  is  to  be  meaning- 
ful. 

Modified  and  special  transformation  are  applied  in  special  situations. 
A  number  of  these  transformations  are  considered  in  this  report.  Finally 
there  is  a  summary  of  transformations  used  in  particular  situations. 


